Anti-Trust Rank: Fighting Web Spam
نویسنده
چکیده
The Web is both an excellent medium for sharing information as well as an attractive platform for delivering products and services. This platform is, to some extent, mediated by search engines in order to meet the needs of users seeking information. Search engines are the “dragons” that keep a valuable treasure: information [8]. Given the vast amount of information available on the Web, it is customary to answer queries with only a small set of results (typically 10 or 15 pages at most). Search engines must then rank Web pages, in order to create a short list of highquality results for users. Web spam can significantly deteriorate the quality of search engine results. Thus there is a large incentive for commercial search engines to detect spam pages efficiently and accurately. Here we present the main techniques recently introduced for Web Spam detection.
منابع مشابه
Propagating Both Trust and Distrust with Target Differentiation for Combating Web Spam
Propagating trust/distrust from a set of seed (good/bad) pages to the entire Web has been widely used to combat Web spam. It has been mentioned that a combined use of good and bad seeds can lead to better results. However, little work has been known to realize this insight successfully. A serious issue of existing algorithms is that trust/distrust is propagated in non-differential ways. However...
متن کاملAnti-Trust Rank for Detection of Web Spam and Seed Set Expansion
In the recent times, the Web has been the most popular and perhaps the most efficient platform for sharing, storing as well as retrieving information. Finding the required information from the Web is facilitated by search engines. Search engines form the interface between the Web and the users. Given the vast amount of information available on the Web, search engines must pick a small subset of...
متن کاملWeb Spam Detection with Anti-Trust Rank
Spam pages on the web use various techniques to artificially achieve high rankings in search engine results. Human experts can do a good job of identifying spam pages and pages whose information is of dubious quality, but it is practically infeasible to use human effort for a large number of pages. Similar to the approach in [1], we propose a method of selecting a seed set of pages to be evalua...
متن کاملLink-Based Similarity Search to Fight Web Spam
We investigate the usability of similarity search in fighting Web spam based on the assumption that an unknown spam page is more similar to certain known spam pages than to honest pages. In order to be successful, search engine spam never appears in isolation: we observe link farms and alliances for the sole purpose of search engine ranking manipulation. The artificial nature and strong inside ...
متن کاملFighting Corruption with e-Government Applications
A well-planned e-government strategy can make leaps into building a more efficient, accountable and transparent government. If planned with representation from key stakeholders, e-government applications can rebuild citizen trust in government, promote economic growth by improving interface with business, and empower citizens to participate in advancing good governance. While e-government is no...
متن کامل